Mediatory Summary Generation: Summary-Passage Extraction for Information Credibility on the Web
نویسندگان
چکیده
In this paper, we discuss the summarization for supporting a user’s judgment on the credibility of information on the Web. In general, if a statement contradicts another statement, the credibility of either of the statements decreases. However, these opposing statements may coexist under certain situations, and presenting such situations is helpful for a user’s judgment. Therefore, we focus on the coexistence of these opposing statements, and attempt to develop a system to generate survey reports that contain mediatory summaries, which are defined as passages extracted from Web documents in order to present situations in which these opposing statements can coexist. We describe the outline of the summarization system and describe how to improve the TextRank algorithm from the viewpoint of passage extraction for the system. From experimental results, we confirmed that the methods based on the improved TextRank algorithm can extract significant passages, which are actually considered as significant by human assessors, with higher precision than baseline methods.
منابع مشابه
A Method for Automatically Generating a Mediatory Summary to Verify Credibility of Information on the Web
In this paper, we propose a method for mediatory summarization, which is a novel technique for facilitating users’ assessments of the credibility of information on the Web. A mediatory summary is generated by extracting a passage from Web documents; this summary is generated on the basis of its relevance to a given query, fairness, and density of keywords, which are features of the summaries co...
متن کاملConstruction of Text Summarization Corpus for the Credibility of Information on the Web
Recently, the credibility of information on the Web has become an important issue. In addition to telling about content of source documents, indicating how to interpret the content, especially showing interpretation of the relation between statements appeared to contradict each other, is important for helping a user judge the credibility of information. In this paper, we will describe the purpo...
متن کاملA survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملAutomatic Text Extraction Based on Field Association Terms and Power Links
The existence of the World Wide Web has caused an information explosion. Readers are overloaded with lengthy text documents where a shorter version would suffice. Text summarization is the process of distilling the most important information from a source to produce an abridged version for a particular user and task. When this is done by means of a computer, i. e. automatically, it is called as...
متن کاملData Extraction using Content-Based Handles
In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired by the way a human user scans the page content for specific data. In particular, we use text fea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009